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PATENT 

ATTORNEY DOCKET NO: 01948/069002 
rARDT AC-CFJ T. SPECIFIC ENHANCER ELEM ENTS AND USES THEREOF 

Cross-Reference to Related Applications 
This application claims benefit from U.S. Provisional Application No. 
60/176,419, filed January 14, 2000 (now pending), hereby incorporated by 
reference. 

Background of the Invention 
The invention relates to reagents and methods for expressing gene 
products in cardiac cells or precursors to cardiac cells in vitro and in vivo. 

Adult mammalian cardiomyocytes do not de-differentiate or re-enter the 
cell cycle. After being placed into cell culture, neonatal cardiomyocytes soon lose 
their ability to proliferate. Several cell lines, including PI 9 teratocarcinoma cells, 
embryonic stem (ES) cells, AT-1, H9c2, QCE-6, or 10T1/2 cells, have some 
molecular characteristics of cardiomyocytes. These cells are very difficult to 
manipulate, however, or are lacking important characteristics of cardiomyocytes. 
Because; of these reasons, cultured neonatal cardiomyocytes from rats or mice are 
often used in in vitro systems, even though these cells are difficult to transfect 
(usually less than 0.1% transfection rate) and require long preparation procedures. 

Recently, it has been demonstrated that immortalized cardiac myogenic 
(CMG) cells can be differentiated from mouse bone marrow stromal cells. This is 
evidence of the generation of cardiomyocytes from a tissue of extra-cardiac origin. 

The possibility of bone marrow being an in vivo source of circulating 
cardiomyocyte progenitors has been previously suggested. A distribution of 



transplanted bone marrow-derived cells in a dystrophic mouse heart has been 
observed , Although the molecular characteristics of these cells were not 
identified, their location in the heart tissue indicated these cells were 
cardiomyocytes. Taken together, it appears that bone marrow stromal cells are an 
extra-cardiac source of cardiomyocytes in vivo, and in vitro induction of beating 
cardiomyocytes from a heterogeneous population of bone marrow cells is possible 
by the introduction of inductive agents such as 5-azacytidine. 

The molecular mechanisms which guide development of cardiac cells 
(and the heart in general) in vertebrates have been the subject of intense 
investigation (Fishman and Chien, Cell 91: 153-156, 1997; Olson and Srivastava, 
Science 272: 671-676, 1996). In most vertebrates, the heart tissue initially 
develops as a crescent shaped mesodermal structure located anteriorly and 
laterally. This precardiac mesoderm is brought ventrally and caudally, by folding 
of the embryo, to a form a single midline heart tube with the inflow region located 
most rostrally. This heart tube undergoes looping, bringing the inflow, ventricular, 
and outflow regions of the heart into the alignment seen in the mature heart. Later, 
chamber septation occurs, valves develop in the atrioventricular (AV) junction as 
well as in the outflow tract, and the outflow tract itself is divided into two great 
vessels. Additional refinements occur with the development of the coronary 
arteries and the cardiac conduction system. 

Heart development is governed by complex signals including inductive 
and positional signals from adjacent structures, as well as signals from a number of 
transcription factors (Fishman and Chien, Cell 91: 153-156, 1997; Lyons, Curr. 
Opin. Genet. Dev. 6: 454-460, 1996; Mohun and Sparrow, Curr. Opin. Genet. Dev. 
7: 628-633, 1997; Olson and Srivastava, Science 272: 671-676, 1996). Since 



transcriptional factors have the ability to activate multiple genes, they are generally 
considered important regulators of organ development. A number of cardiac 
transcription factors have been identified that have important influences on the 
early stages of specification and differentiation of the cardiac mesoderm (Tanaka 
5 et al., Dev. Genet. 22: 239-249, 1998). Csx/Nkx2.5 (Komuro and Izumo, Proc. 
Natl. Acad. Sci. USA 90: 8145-8149, 1993; Lints et al., Development 119: 419- 
431, 1993), MEF-2C (Edmondson et al, Development 120: 1251-1263, 1994), 
GATA4 (Heikinheimo et al., Dev. Biol 164: 361-373, 1994; Kelley et al., 
Development 118: 817-827, 1993) and dHAND and eHAND are members of four 

10 different classes of transcriptional factors all expressed in the heart at early stages 
of development. Targeted disruption of any one of these genes yields severe 
cardiac and extracardiac phenotypes, and results in death of the embryo between 
E9.5 and El 0.5 of development. 

The mouse Csx/Nkx2.5 gene is first expressed in the cardiac progenitor 

15 cells at embryonic day 7.5 (E7.5), and during this stage is detected principally in 
the hear!: and tongue (and, to a lesser extent, in spleen, stomach, liver, and larynx). 
The extra-cardiac expression of Csx/Nkx2.5 is markedly reduced after birth, 
however, and in the adult, the expression is confined to the heart. Mice in which 
the Csx/Nkx.2-5 gene has been deleted have no functional heart, causing an 

20 embryonic lethality by E9 . 5 - 1 1 . 5 . 

Most tissue-specific gene expression is controlled by enhancer and 
repressor sequences at the transcriptional level. Generally, to confer tightly- 
regulated expression, enhancers adopt complex regulatory mechanisms that require 
the collaboration of multiple transcription factors. The binding sites 

25 for these transcription factors may be many kilobases (kb) from the gene promoter 



and dispersed relative to each other. 

It is desirable to be able to express genes in a cardiac cell-specific 

manner. This would be useful, for example, for the targeted expression of genes 

encoding therapeutic proteins for the treatment of damaged heart tissue. 
5 Moreover, to maximize the utility of stem cell-derived cardiomyocytes, for 

example,, in the treatment of damaged heart tissue in humans and other animals, it 

is desirable to be able to rapidly purify cardiac cells from a potentially 

heterogenous cell population. 

Accordingly, there is a need for the development of reagents and 
10 methods for achieving cardiac cell-specific gene expression. The present 

invention provides these reagents and methods. 

Summary of the Invention 
In a first aspect, the invention features a substantially purified nucleic 
acid molecule comprising an enhancer element having: (a) 100% identity to 40 
15 contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 1 or 
SEQ ID NO.: 3; (b) at least 91% identity to 50 contiguous nucleotides of the 
nucleic acid molecule shown in SEQ ID NO.: 2; (c) at least 97% identity to 60 
contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 1 or 
SEQ ID NO.: 3; or (d) at least 95% identity to 70 contiguous nucleotides of the 
20 nucleic acid molecule shown in SEQ ID NO.: 1 or SEQ ID NO. : 3 . 

In a second related aspect, the invention features a substantially purified 
nucleic acid molecule comprising a cardiac-specific enhancer element derived 
from a human, wherein the enhancer element has at least 60% identity to 50 
contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 1, 
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SEQ ID NO.: 2, or SEQ ID NO.: 3. Preferably, the element has at least 70% 
identity to 50 contiguous nucleotides of the nucleic acid molecule shown in SEQ 
ID NO.: 1, SEQ ID NO.: 2, or SEQ ID NO.: 3. More preferably, the identity is at 
least 80%, and most preferably, the identity is at least 90%, when compared to 50 
contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 1, 
SEQ ID NO.: 2, or SEQ ID NO.: 3. 

Preferably, when expressed in vivo, the enhancer element is active in all 
four cardiac chambers. The enhancer element of the first or second aspect may be 
naturally occurring, or it may be non-naturally occurring. 

Preferably, the enhancer element of the first or second aspect includes a 
binding site selected from the group consisting of Mef2, dHAND, GATA, TGF-J3, 
CarG, E-box, and Csx/Nkx2.5 binding sites. More preferably, the enhancer 
element includes at least two binding sites selected from this group. The enhancer 
element preferably also includes an Sp-1 binding site. 

In a third aspect, the invention also features a substantially purified non- 
naturally occurring nucleic acid molecule that includes at least three transcription 
factor binding sites selected from Mef2, dHAND, GATA, TGF-p, CarG, E-box, and 
Csx/Nkx2.5 binding sites. More preferably, the nucleic acid molecule includes 
four transcription factor binding sites, and most preferably includes five 
transcription factor binding sites selected from the aforementioned group. 
Preferably, the nucleic acid molecule, when operably linked to a promoter, 
increases activity of the promoter by at least two-fold in a cardiac cell-specific 
manner. 

In a fourth aspect, the invention features a substantially purified nucleic 
acid molecule comprising an enhancer element having: (a) 100% identity to 50 



contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 6; (b) 
at least 97% identity to 60 contiguous nucleotides of the nucleic acid molecule 
shown in SEQ ID NO.: 6; (c) at least 93% identity to 70 contiguous nucleotides of 
the nucleic acid molecule shown in SEQ ID NO.: 6; or (d) at least 90% identity to 
5 100 contiguous nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 6. 

In a fifth aspect, the invention features a substantially purified nucleic 
acid molecule that includes a cardiac-specific enhancer element derived from a 
human, wherein the enhancer has at least 45% identity to 50 contiguous 
nucleotides of the nucleic acid molecule shown in SEQ ID NO.: 6. Preferably, the 
1 0 element has at least at least 50% identity to 50 contiguous nucleotides of the 

nucleic acid molecule shown in SEQ ID NO.: 6, more preferably, the element has 
at least 60% identity to 50 contiguous nucleotides of the nucleic acid molecule 
shown in SEQ ID NO.: 6, and most preferably, the element has at least 75% 
identity or even 90% identity to 50 contiguous nucleotides of the nucleic acid 
1 5 molecule shown in SEQ ID NO.: 6. The element may be naturally occurring or 
non-naturally occurring. 

In a fifth aspect, the invention features a substantially purified nucleic 
acid molecule comprising 50 contiguous nucleotides that have a sequence that is 
that at least 90% identical to 50 contiguous nucleotides of the nucleic acid 
20 molecule of SEQ ID NO.: 4 or SEQ ID NO.: 5. 

In a sixth aspect, the invention features a DNA vector that includes the 
nucleic acid molecule of the first, second, third, fourth, or fifth aspects. The DNA 
vector can also have a promoter operably linked to a gene of interest. The gene of 
interest is preferably a cardiogenic gene (e.g., a gene encoding BMP2, BMP4, 
25 GATA4, dHAND, eHAND, MEF2C, IRX4, SRF, or Csx/Nkx2 .5), a reporter gene 
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(e.g., a gene encoding GFP, p-gal, alkaline phosphatase, chloramphenicol acetyl 
transferase, or luciferase), a gene encoding a selectable marker (e.g., a gene that 
provides resistance to neomycin, kanamycin, or hygromycin), or a gene encoding a 
therapeutic protein (e.g., a growth factor, a cytokine, an anti-apoptotic factor, a 

5 pro-apoptotic factor, or a protein that improves cardiac function or repair). 

In a seventh aspect, the invention features a method for inducing a cell 
to become a cardiac cell. The method includes (a) introducing into the cell or 
ancestor thereof a DNA vector that includes (i) the nucleic acid of the first, second, 
third, fourth, or fifth aspect; (ii) a promoter; and (iii) a cardiogenic gene operably 

1 0 linked to the promoter; and (b) placing the cell under conditions that result in 
expression of the cardiogenic gene operably linked to the promoter. Preferably, 
expression of the cardiogenic gene further enhances expression of cardiogenic 
genes by binding to cardiac-specific enhancer elements. 

In an eighth aspect, the invention feature a method for specifically 

1 5 expressing a gene in cardiac cells, said method comprising introducing into the cell 
or ancestor thereof a DNA vector that includes (i) the nucleic acid of the first, 
second, third, fourth, or fifth aspect; (ii) a promoter; and (iii) the gene operably 
linked to the promoter. Preferably, the nucleic acid allows expression of the gene 
in a cardiac cell and does not express said gene in at least one cell that is not a 

20 cardiac cell. 

In a ninth aspect, the invention features a method for determining the 
efficacy of a method of inducing target cells to produce or become cardiac cells, 
the method including: (a) introducing into at least one target cell (or an ancestor of 
the target cell) a DNA vector that includes (i) the nucleic acid of the first, second, 

25 third, fourth, or fifth aspect; (ii) a promoter; and (iii) a reporter gene operably 
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linked to the promoter; (b) performing a method for potentially inducing the target 
cells to produce or become cardiac cells; and (c) determining the number or 
percentage of cells that are reporter gene-positive, wherein a higher number or 
percentage indicates a higher efficacy of the method of inducing stem cells to 
5 produce or become cardiac cells. Preferably, the target cells are stem cells such as 
bone marrow stem cells or embryonic stem cells. 

In a tenth aspect, the invention features a method for determining the 
efficacy of a method of inducing target cells to produce or become cardiac cells, 
the method including: (a) introducing into at least one target cell (or an ancestor of 
10 the target cell) a DNA vector that includes (i) the nucleic acid of the first, second, 
third, fourth, or fifth aspect; (ii) a promoter; and (iii) a gene, encoding a selectable 
marker, operably linked to the promoter; (b) performing a method for potentially 
inducing the target cells to produce or become cardiac cells;(c) performing a drug 
selection, wherein cells expressing said gene encoding the selectable marker are 
1 5 capable of surviving in the presence of the drug and cells not expressing the gene 
encoding the selectable marker are not capable of surviving in the presence of the 
drug; and (d) determining the survival of cells following drug selection, wherein a 
higher cell survival indicates a higher efficacy of the method of inducing stem 
cells to produce or become cardiac cells. In this method, step (b) can be performed 
20 before or after step (c). Preferably, the target cells are stem cells such as bone 
marrow stem cells or embryonic stem cells. 

In an eleventh aspect, the invention features a method of identifying a 
cardiac cell, including (a) introducing into the cell (or an ancestor of the cell) a 
DNA vector that includes (i) the nucleic acid of claim the first, second, third, 
25 fourth, or fifth aspect; (ii) a promoter; and (iii) a reporter gene operably linked to 
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the promoter, whereby the cell expresses the reporter gene if the cell is a cardiac 
cell; (b) allowing sufficient time for the reporter gene to be expressed in cardiac 
cells; and (c) identifying the cardiac cells by the presence of the reporter gene. 
Preferably, the method is performed in vitro. 

5 In a twelfth related aspect, the invention features a method of 

substantially purifying a cardiac cell from a heterogeneous population of cells, 
including: (a) introducing into at least a subset of cells in the population (or 
ancestors of these cells) a DNA vector that includes (i) the nucleic acid of the first, 
second, third, fourth, or fifth aspect; (ii) a promoter; and (iii) a reporter gene 

1 0 operably linked to the promoter, whereby a cell expresses the reporter gene if the 
cell is a cardiac cell; (b) determining whether a cell in the heterogeneous 
population is expressing the reporter gene; and (c) if the cell is expressing the 
reporter gene, purifying the cell from the heterogeneous population. 

In a thirteenth aspect, the invention features a method of expressing a 

1 5 gene encoding a therapeutic protein in a cardiac cell. The method includes 

introducing to the cell (or an ancestor of the cell) a DNA vector that includes (i) 
the nucleic acid of the first, second, third, fourth, or fifth aspect; (ii) a promoter; 
and (iii) a gene encoding a therapeutic protein operably linked to the promoter 
such that the gene is expressed in cardiac cells. The cell may be a cardiac cell, or 

20 it may be a cell that is capable of differentiating as a cardiac cell. The method may 
be performed in vivo or in vitro. If the method is performed in vitro, the cell (or a 
descend ent of the cell) can be grafted into a patient. 

In all of the foregoing aspects of the invention, an enhancer element is 
defined as a nucleic acid sequence that when present, (i) increases in a cardiac cell 

25 expression of a gene to which it is operably linked by at least 25%, (ii) allows for 
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gene expression in the heart tube prior to looping, (iii) allows for gene expression 
in all four heart chambers; or (iv) increases cardiac expression 100% more than it 
increases extracardiac expression. Preferably, in (i), the expression is increased by 
at least 50%, more preferably by 100%, and most preferably by 200% 

As used herein, by "nucleic acid" is meant either DNA or RNA. A 
"nucleic acid molecule" may be a single-stranded or double-stranded polymer of 
deoxyribonucleotide or ribonucleotide bases. Unless otherwise specified, the left 
hand direction of the sequence of a single-stranded nucleic acid molecule is the 5' 
end, and the left hand direction of double-stranded nucleic molecule is referred to 

as the 5' direction. 

By "promoter" is meant a region of nucleic acid, upstream from a 
translational start codon, which is involved in recognition and binding of RNA 
polymerase and other proteins to initiate transcription. A "human promoter" is a 
promote* capable of initiating transcription in a human cell, and may or may not be 
derived from a human cell. A "Csx/Nkx2.5 promoter" is one derived from the 
promoter region of a Csx/Nkx2.5 gene and that, when operably linked to a 
heterologous nucleic acid molecule, is capable of initiating transcription of that 
molecule (when present in a transcription medium capable of supporting 
transcription) in a cardiac cell. 

By "enhancer element" is meant a nucleic acid sequence that, when 
positioned proximate to a promoter and present in a transcription medium capable 
of supporting transcription, confers increased transcription activity relative to the 
transcription activity resulting from the promoter in the absence of the enhancer 
domain. 

By "operably linked" is meant that two or more nucleic acid molecules 
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(e.g., a nucleic acid molecule to be transcribed, a promoter, and an enhancer 
element) are connected in such a way as to permit transcription of the nucleic acid 
molecule in a suitable transcription medium. 

By "derived from" is meant that a the nucleic acid molecule was either 
5 made or designed from a second nucleic acid molecule, the derivative retaining 
important functional features of the nucleic acid molecule from which it was made 
or designed. 

By "expression construct" is meant a nucleic acid molecule that is 
capable of directing transcription. An expression construct of the present 
10 invention includes, at the least, a cardiac-specific enhancer element and a 
m promoter. Additional elements, such as a transcription termination signal, may 

'""tH 

QH also be included, as described herein. 

By "vector" or "expression vector" is meant an expression system, a 
^! nucleic acid-based vehicle, a nucleic acid molecule adapted for nucleic acid 

L 15 delivery, or an autonomous self-replicating circular DNA (e.g., a plasmid). When 
f* a vector is maintained in a host cell, the vector can either be stably replicated by 

*D the cells during mitosis as an autonomous structure, incorporated within the 

N j genome of the host cell, or maintained in the host cell's nucleus or cytoplasm. 

By "cardiac cell" is meant a differentiated cardiac cell (e.g., a 
20 cardiomyocyte) or a cell committed to producing or differentiating as a cardiac cell 
(e.g., a cardiomyoblast or a cardiomyogenic cell). 

By "cardiac-specific enhancer element" is meant an element, operably 
linked to a promoter, that directs gene expression in a cardiac cell and does not 
direct gene expression in all tissues or all cell types. Cardiac-specific enhancers of 
25 the present invention may be naturally occurring or non-naturally occurring. One 
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skilled in the art will recognize that the synthesis of non-naturally occurring 
enhancers can be performed using standard oligonucleotide synthesis techniques. 

By "plasmid" is meant an autonomous DNA molecule capable of 
replication in a cell, and includes both plasmids designed for expression and 
5 plasmids designed for nucleic acid replication. 

By "heterologous" is meant that the nucleic acid molecule originates 
from a foreign source or, if from the same source, is modified from its original 
form. Thus, a "heterologous promoter" is a promoter not normally associated with 
the duplicated enhancer domain of the present invention. Similarly, a heterologous 
1 0 nucleic acid molecule that is modified from its original form or is from a source 
different from the source from which the promoter to which it is operably linked 
was derived. 

By "substantially pure nucleic acid" is meant nucleic acid that is free of 
the genes which, in the naturally-occurring genome of the organism from which 

1 5 the nucleic acid of the invention is derived, flank the nucleic acid. The term 

therefore includes, for example, a recombinant nucleic acid which is incorporated 
into a vector; into an autonomously replicating plasmid or virus; or into the 
genomic nucleic acid of a prokaryote or a eukaryote cell; or which exists as a 
separate; molecule {e.g., a cDNA or a genomic or cDNA fragment produced by 

20 PCR or restriction endonuclease digestion) independent of other sequences. It also 
includes a recombinant nucleic acid which is part of a hybrid gene encoding 
additional polypeptide sequence. 

By "transgene" is meant any piece of a nucleic acid molecule (for 
example, DNA) which is inserted by artifice into a cell either transiently or 

25 permanently, and becomes part of the organism if integrated into the genome or 
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maintained extrachromosomally. Such a transgene may include a gene which is 
partly or entirely heterologous (i.e., foreign) to the transgenic organism, or may 
represent a gene homologous to an endogenous gene of the organism. 

By "transgenic cell" is meant a cell containing a transgene. For 
example, a stem cell transformed with a vector containing the expression vector of 
the present invention operably linked to a heterologous nucleic acid molecule can 
be used to produce a population of cells having altered phenotypic characteristics. 

Other features and advantages of the invention will be apparent from the 
following description of the preferred embodiments thereof. 

Brief Description of the Drawings 

Fig. 1 is a schematic illustration showing the structure of the mouse 
Csx/Nkx2.5 (mCsx/Nkx2.5) locus constructs utilized for transgenic experiments. 
Open boxes and shaded boxes represent untranslated sequences and coding 
sequence, respectively. Various subfragments of these two X phage clones were 
utilized for construction of transgenes (CsxlacZ-1 through CsxlacZ-6). 

Fig. 2 is a schematic illustration showing a representation of lacZ 
expression pattern seen in the heart with the mCsx/Nkx2.5 constructs tested. 

Fig. 3 A is an alignment of human Csx/Nkx2.5 (hCsx/Nkx2.5) and 
rnCsx/Nkx2.5 and sequence approximately 9 kb 5' to the ATG of exon 1. Shaded 
area indicate the location of homology domains Al (top) and A2. The location of 
transcription factor binding sites is also indicated. 

Fig. 3B (top sequence) is an alignment of mCsx/Nkx2. 5 and hCsx/Nkx2.5 
sequence approximately 3 kb 5 ? to the ATG of exon 1. Within this sequence is 
homology domain B (SEQ ID NO.: 6). 
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Fig. 3C is a schematic illustration showing the genomic organization of 
mCsx/Nhc2.5 and hCsx/Nkx2.5. Homology domains A-D are indicated. Note that 
in homology domain A, the human sequence is interrupted by sequence that has no 
identity to the mCsx/Nkx2.5 sequence. 
5 Fig. 4A shows the nucleotide sequence for SEQ ID NO.: 4, -7.5 kb of 

hCsx/Nkx2.5 genomic sequence, the genomic location of which is shown on Fig. 6. 

Fig. 4B shows the nucleotide sequence for SEQ ID NO.: 5, -6.8 kb of 
hCsx/Nkx2.5 genomic sequence, the genomic location of which is shown on Fig. 6. 

Fig. 5 A shows the nucleotide sequence of mCsx/Nkx2.5 homology 
10 domains Al (SEQ ID NO.: 1) and A2 (SEQ ID NO.: 2). 

Fig. 5B shows the nucleotide sequence of hCsx/Nkx2. 5 homology 
domains Al and A2 and the intervening sequence that has no counterpart in the 
mCsx/Nkx2.5. 

Fig. 5C shows the nucleotide sequence of hCsx/Nkx2. 5 homology 
1 5 domains B (SEQ ID NO. : 6). 

Fig. 6 is a schematic illustration showing the transgenic constructs of the 
hCsx/Nhc2.5 enhancer elements that were tested for cardiac-specific expression. 
The locations of SEQ ID NO.: 4 and SEQ ID NO.: 5 are also shown. 

Fig. 7 is a table summarizing the transgenic analysis of the hCsx/Nkx2.5 
20 enhancer elements. The constructs correspond to those shown in Fig. 6. 

Fig. 8 is a series of photographs showing transgene expression in mice 
that resulted from the indicated enhancer-hsp68-/acZ constructs. 

Fig. 9 is a series of photographs showing transgene expression in mice 
that resulted from the 7.5 kb enhancer-hsp68-/acZ constructs. 
25 Fig. 10 is a schematic illustration showing various uses of cardiac- 
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specific enhancer elements for facilitating the production or purification of cardiac 
cells. 

Detailed Description of the Invention 

5 We have cloned cardiac enhancers from hCsx/Nkx2J and m,Csx/Nhc2.5 

that, when used to drive transgene expression in mice, recapitulate expression 
patterns of the endogenous mCsx/Nhc2.5. One of the hCsx/Nkx2.5 enhancer 
constructs, 7.5 kb-hsp-/acZ, constructed by operably linking 7.5 kb of genomic 
region from hCsx/Nkx2.5 (SEQ ID NO.: 4) to an hsp68 promoter-/acZ cassette, is 

10 active as; early as E7.5, the earliest time point of cardiac development, and 

recapitulates the expression pattern of endogenous mCsx/Nhc2.5 at E9.5, E10.5, 
and E12.5. Thus, when the four chambers of the heart are formed at E12.5, the 
enhancer is active in all four chambers. Among the mammalian cardiac enhancers 
known so far, this 7.5 kb enhancer is the earliest enhancer that is active in all four 

15 chambers. Moreover, this enhancer displays no ectopic expression. 

Within this 7.5 kb fragment, we have further isolated two regions 
(referred to herein as homology domain Al (HDA1; SEQ ID NO.: 1) and 
homology domain A2 (HDA2; SEQ ID NO.: 2) that together, when operably 
linked to an hsp68 promoter-/acZ cassette, are capable of enhancing gene 

20 expression in a cardiac-specific manner. 

Prior art methods of CMG cell isolation are time consuming and 
produce low cell yields. Generally, cardiomyocytes are detected by their beating. 
The proportion of beating cardiomyocytes from primary bone marrow stem cells is 
very low, and even after immortalized CMG cells are established, approximately 

25 70% of cells derived from them are not beating. The failure to achieve a high 
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percentage of beating cardiomyocytes may be due to the incomplete induction of 
the cardiomyocyte phenotype by 5-azacytidine. Due to the extended time period 
currently required to isolate CMG cells, it was previously difficult to improve 
cardiomyocyte induction methods. Beating is one of the last phenotypes to be 

5 exhibited by cardiomyocytes. The present invention provides an earlier marker of 
cardiomyocyte induction. This is useful, for example, for optimizing 
cardiomyocyte induction methods. By reducing the length of time required for 
testing, one will be able to more rapidly improve the efficacy of induction of the 
desired (jells. The present invention also provides methods for enriching for 

10 cardiomyocytes by, for example, sorting based on expression of a reporter gene 
operably linked to a Csx/Nkx2.5 cardiac cell-specific enhancer. These and other 
uses are described in greater detail in the examples. 

F.xample 1 : Characterization of mCsx/Nhc2.5 enhanc er elements 

As an initial step in defining the regulatory region of Csx/Nhc2.5, we 

1 5 determined the 5' end of the mCsx/Nkx2. 5 transcript by 5* RACE with total RNA 
from hearts of three- week-old mice. Three different transcripts which were 
identical for most of their length, except for distinct 5' ends, were identified 
(transcripts I, II, and III). The sequence comparison with the genomic DNA 
sequence showed exon - intron boundaries in accordance with the consensus exon 

20 - intron junction sequences. Therefore, we named the most 5' cDNA segment exon 
la, the most 3' cDNA segment exon lb, and the intervening genomic sequence 
intron 1. Likewise, the exon that contains the start codon is termed exon lc, 
flanked by introns 2 and 3, and followed by the last exon, exon 2, which includes 
the homeodomain (HD) and TAG translation stop codon. 
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Transcript I contains two segments of cDNA at its most 5' end which are 
discontinuous in the genomic sequence. The most 5' segment of transcript I is 46 
bp in length (exon la) and is 669 bp upstream from the second discontinuous 
segment . The second discontinuous segment is 36 bp long (a part of exon lb) and 

5 3504 bp upstream from the translation start codon. For transcript II, exon lb is the 
most 5' sequence and is extended more 5' than in transcript I, in this case 93 bp 
long, including the 36 bp internal fragment found in transcript I. In transcript III, 
the first exon at the 5' end, exon lc, includes 71 bp of continuous genomic 
sequence which is spliced out in transcripts I and II. 

10 To confirm the existence of the novel exons, we performed primer 

extension analysis. With adult mouse heart poly(A) RNA, two bands were 
observed which correspond to the sizes of transcripts II and III. The band 
representing transcript III was much stronger, suggesting that it is the major site of 
initiation. Since we could not detect a band corresponding to the transcript I by 

1 5 primer extension analysis, we performed RT-PCR using total RNA samples from 
several tissues. With exon la-specific and exon lc-specific primer pairs, RT-PCR 
generated products of the predicted size (1 12 bp) from the heart and spleen. No 
band was amplified with RNA from kidney, which does not express Csx/Nkx2.5. 
By Southern blotting, this 1 12 bp band was shown to include exon lb using an 

20 exon lb-specific oligonucleotide probe. 

To further characterize the genomic structure of the mCsx/Nkx2.5 gene, 
we subcloned and sequenced 8.6 kb of genomic DNA. The 3' end of exon la is 
located 4265 bp upstream from the ATG codon, followed by intron 1 (668 bp), 
exon lb (93 bp), intron 2 (3179 bp), exon lc (655 bp) that includes the ATG 

25 translation start site, intron 3 (1377 bp), and exon 2 (983 bp) with TAG stop codon 
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and a poly(A) addition signal. The genomic sequences around the three putative 
transcription initiation sites show no TATA box, indicating that mCsx/Nkx2J is a 
TATA-less gene. 

3.3 and 4 kb of 5 f flanking sequence drive lacZ expression in the outflow 
tract and basal portion of the right ventricle as well as in the pharynx, 
thyroid primordium, and stomach 

Since the major mCsx/Nkx2.5 transcript (transcript III) starts 325 bp 
upstream from the ATG codon, we searched for enhancer elements in the genomic 
region immediately upstream to exon lc. The 3.3 kb of 5' flanking sequence 
between the Notl site and the ATG codon was fused to the lacZ reporter gene 
(CsxlacZ-1; Fig. 1) and injected into mouse embryos to generate transgenic mice. 
Of 40 embryos examined, eight embryos carried the transgene and three were 
noted to have P-gal staining. When the lacZ expression was assessed in these 
embryos at E10.5, we found that this 3.3 kb upstream region could drive lacZ 
expression in cardiac and extracardiac tissues. Myocardial cells in the outflow 
tract and the basal part of the right ventricle were strongly positive for lacZ, and 
there were also a few positive cells in the trabecular layer of the right ventricle. In 
the outflow tract, lacZ was strongly expressed in the myocardium, but no lacZ 
expression was detected in the aortic sac or endocardial cushions. 

At E10.5, extracardiac expression of lacZ was also observed in locations 
where the endogenous mCsx/Nkx2.5 was detected, such as the pharyngeal floor, 
thyroid primordium, and in the distal part of the stomach. In the stomach, lacZ 
expression was observed in mesenchymal cells. Ectopic expression of lacZ was 
observed in the surface ectoderm of pharyngeal arches and the laryngotracheal 
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groove. The lacZ expression pattern in both cardiac and extracardiac tissues was 
substantially identical in all three /acZ-positive embryos, with the exception of an 
ectopic expression in the glossopharyngeal ganglions in one embryo. 

The CsxlacZ-2 construct included the same 3.3 kb of 5* flanking 

5 sequence plus 1.4 kb of intron 3 sequence (Fig. 1). Of 34 embryos, six carried the 
CsxlacZ -2 transgene and, of these, three had positive P-gal staining. Cardiac 
expression was seen in the basal portion of the right ventricle and in the outflow 
tract, ami extracardiac expression was seen in the pharyngeal floor, the thyroid 
primordium, and the stomach in two embryos. In one embryo, lacZ expression 

1 0 was noted in the pharyngeal floor but not in other sites. This lack of expression 
was probably due to a negative effect of the transgene integration site. 

The addition of most of intron 1 and exon lb to the CsxlacZ-1 construct 
yielded CsxlacZ-3 (Fig. 1). Two embryos were positive for CsxlacZ-3 transgene, 
with one embryo having positive P-gal staining. The lacZ expression was observed 

15 in the outflow tract, right ventricle, pharynx, thyroid primordium, and stomach. 
Thus, the cardiac P-gal staining pattern was quite similar for constructs CsxlacZ-1, 
CsxlacZ-2, and CsxlacZ-3. The extracardiac expression pattern was also very 
similar except for the sites of ectopic expression. 

CsxlacZ-4 contains 6 kb of 5' flanking sequence. Twenty-two injected 

20 embryos were analyzed, and ten found to carry the CsxlacZ-4 transgene. 

Interestingly, none of the CsxlacZ-4 transgenic embryos showed any p-gal staining 
in the heart nor in any extracardiac tissues. This finding indicates that a strong 
negative regulatory element exists between 6 kb and 4 kb 5' to exon lc (Fig. 2). 
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A cis-regulatory element for expression of mCsx/Nkx2.5 in the medial 
wall and inner trabeculae of the right and left ventricles exists between 
14 and 6 kb of 5 'flanking sequence. 

With 14 kb of 5 ! flanking sequence driving the reporter gene (CsxlacZ- 
5 5), lacZ was expressed more widely in the heart than with CsxlacZ- 1 . P-gal staining 
was observed in the entire right and left ventricles, except for the compact layer of 
the lateral walls. In the atrium, there was a cluster of /acZ-positive myocardial 
cells in the interatrial groove, which is above the future atrial septum, p-gal staining 
was also seen in the outflow tract as in CsxlacZ- 1. Cardiac staining was similar in 
10 all four transgenic embryos with positive P-gal staining. 

Extracardiac expression of CsxlacZ-5 was observed in the pharyngeal 
floor and thyroid primordium in all four CsxlacZ-5 transgenic embryos. Only one 
of the four CsxlacZ-5 embryos was noted to have weak p-gal staining in the 
stomach, in contrast to strong p-gal staining in the stomach of transgenic embryos 
15 harboring CsxlacZ-1, -2, or -3 constructs. No ectopic expression was noted in any 
of the CsxlacZ-5 embryos, indicating the presence of repressor elements that 
restrict expression of mCsx/Nkx2. 5 in a tissue-specific manner. 

An enhancer for the right ventricle is located within 6 kb of the 3 f 
flanking sequence 

20 To examine cis-regulatory elements downstream of the coding sequence 

of mCsx/Nkx2.5, we linked 6 kb of 3' flanking sequence to the CsxlacZ- 1 construct 
to create CsxlacZ-6 (Fig. 1). Of 38 embryos examined, 12 carried the CsxlacZ-6 
transgene, of which 6 were /acZ-positive. Three transgenic embryos had p-gal 
staining in the entire right ventricle, including the compact layer of the lateral 
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walls. The other three had no cardiac staining, but did have weak extracardiac 
staining. Extracardiac staining in all six embryos included the stomach, pharynx, 
and thyroid primordium. 



Different expression patterns of the reporter gene in adult transgenic 

5 mice 

In addition to the evaluation of transient transgenic embryos described 
above, lines of transgenic mice were created and Fl transgenic mice were 
evaluated at different time points in development. 

In the transgenic CsxlacZ-1 line, which carried the 3.3 kb 5' flanking 

10 sequnece, no |3-gal staining could be observed at E7.5. At E9.5, the pattern of 
cardiac and extracardiac lacZ expression was identical to that seen in the transient 
transgenic embryos. At El 5.5, lacZ expression was also observed in the spleen, 
but not in the tongue. In the E15.5 heart, lacZ expression was reduced, but 
detectable in the outflow tract region of the right ventricle, while the transgene 

15 expression in the thyroid gland was still strong. Interestingly, when adult animals 
of the same transgenic CsxlacZ-1 line were analyzed, 0-gal staining was completely 
absent in the hearts. 

In the transgenic CsxlacZ-5 line, which carried the 14 kb 5' flanking 
sequence, |3-gal staining was observed in the cardiac crescent at E7.5, in the 

20 common ventricle and outflow tract at E8.25, and in the right ventricle, left 

ventricle, septum, and outflow tract at E9.5. At E15.5, the cardiac p-gal staining 
was significantly down-regulated, except for patchy expression in the AV junction 
and in the interventricular septum. The spleen was also stained, but the tongue 
was not at E15.5. Analysis of adult CsxlacZ-5 transgenic mice showed patchy p-gal 
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staining along the luminal surface of the right ventricle, particularly along the 
ventricular septal surface. P-gal staining was also seen in a small part of the left 
ventricle: base and in the AV junction area. 

Autoregulation of mCsx/Nkx2.5 expression 

One of the mechanisms to ensure tissue-specific expression of a 
transcription factor is a positive autoregulation of its own promoter, as has been 
shown for the MyoD gene. The mCsx/Nhc2.5 promoter contains multiple NKE 
elements, the binding sites for Csx/Nkx2.5 protein (Chen and Schwartz, J. Biol. 
Chem. 270:15628-15633, 1995). In order to determine whether positive 
autoregulation of mCsx/Nkx2,5 is present, we examined lacZ expression in mice 
homozygous null mutant for rnCsx/Nkx2.5 created by homologous recombination. 
Since lacZ expression is under the control of 5' and 3' regulatory regions of 
mCsx/NI<x2.5 in the mutant embryos, p-gal staining will be weaker or absent in 
homozygous null embryos if positive regulation by mCsx/Nkx2.5 is present. 
Unexpectedly, P-gal staining was far stronger in the homozygous mutant embryos 
than in the heterozygous mutant embryos stained simultaneously. These data 
indicate that a negative feedback, either directly or indirectly, of mCsx/Nkx2. 5 
exists, since the intensity of P-gal staining in the homozygous mutant heart was 
much more than double that in the heterozygous mutant heart. To confirm this 
result, we performed semi-quantitative RT-PCR using RNA extracted from the 
heart of heterozygous and homozygous mutant embryos at E9.5. Transcripts for 
lacZ in homozygous mutant hearts were approximately 8-fold those in 
heterozygous mutant hearts, after adjustment using the relative abundance of the 
transcript for a-cardiac actin. 
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The foregoing results were achieved with the following materials and 

methods. 

Isolation of genomic clones and sequencing 

Two lambda phage genomic DNA clones containing the mCsx/Nhc2.5 
5 coding and flanking sequences were isolated from a 129 mouse genomic library in 
AD ASH II. Clone 1 contained 14 kb of 5' flanking region in addition to a portion of 
the mCsx/Nkx2J coding region. Clone 2 contained 6 kb of 5 1 flanking region, the 
mCsx/Nioc2.5 coding region, and 6 kb of 3* flanking region. In addition, a mouse 
PI clone (Genome Systems, St. Louis, MO) which contained the mCsx/Nkx2.5 
10 coding sequence, as well as more than 10 kb 5 f and 3 f flanking sequence, was 

isolated. Restriction enzyme mapping and serial Southern blotting were performed 
using standard techniques. 

Definition of 5 ! end of mRNA 

5 f RACE was performed following previously described methods with 
15 some modifications (Reecy et al, Dev. Biol. 188: 295-31 1, 1997). The first strand 
cDNA synthesis reaction was performed using 5 jig total RNA from the heart of 
three- week old mice and 100 ng of random primers (Promega, Madison, WI) in a 
40 y\ volume containing 50 mM Tris-HCl (pH 8.3), 75 mM KC1, 3 mM MgCl 2 , 10 
\xM DTT, 40 U RNase inhibitor (Promega), 0.5 mM each deoxynucleotide, 200 U 
20 M-MLV reverse transcriptase (Life Technologies, Gaithersburg, MD) for 90 

minutes at 37°C. The cDNA was purified using a QIAQUICK™ PCR purification 
column (QIAGEN, Santa Clara, CA) after incubation with 5 jig RNase A 
(Ambion, Austin, TX). Ligation-anchored PCR was performed as described by 
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Ali Ansari-Lari et al. (BioTechniques 21: 34-36, 1996) with some modifications. 
Initially, 3 nmoles of a 3' ammo-modified 5' phosphorylated anchor primer (5- 
TCT CTA CTC CGA ATT CCG TCG TCC ACA CCT-3'; SEQ ID NO: 7) was 
end ligated to one half of the purified first strand cDNA using 20 U of T4 RNA 

5 ligase (New England Biolabs, Beverly, MA) in a 50 uL reaction volume 
containing 50 mM Tris-HCl (pH 8.0), 10 mM MgCl 2 , 1 mM hexamine cobalt 
chloride, 20 uM ATP, 25% PEG 800 and incubated for 24 hours at 16°C. The 
anchor-ligated cDNA was further purified using a QIAQUICK PCR purification 
column. The first round PCR was performed using one-tenth of the purified 

10 anchor-ligated cDNA, an anchor-specific primer (5'-AGG TGT GGA CGA CGG 
AAT TCG GAG TAG AGA-3*; SEQ ID NO: 8) and a Qx/Mx2.5-specific primer 
(5'-GGG GGC GGC TGG GAA AGC AGG AGA GCA CTT-3'; SEQ ID NO: 9). 
PCR conditions were as follows: denaturation at 94°C for one minute, annealing at 
60°C for one minute, extension at 72°C for two minutes with an additional five 

1 5 minutes incubation at 72°C at the end of the 30 cycles. Subsequent PCR reactions 
were performed using 5 ul of each PCR product, a nested anchor primer (5-CGA 
CGG AAR TCG GAG TAG AGA-3'; SEQ ID NO: 10), and one of the 
Csx/M*25-specific primers (5'-TTG AAG GCG GCC AGC ATG CAG GAG 
GCA-3' (SEQ ID NO: 1 1) or 5'- ACA GGA GCG ACG GGC AGT TCT GCG T-3' 

20 (SEQ ID NO: 12)) at the same PCR conditions as above. The PCR products were 
visualized on a 2% agarose gel with ethidium bromide, and subsequently 
subcloned and sequenced. 
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Detection of exons lb and lcby primer extension 
Five microgram samples of mouse heart poly (A) RNA were 
coprecipitated with 2 x 10 4 cpm end-labeled, gel-purified oligonucleotide probe 
(5'-CGG AGC ACC AGG GGC AGA AGA GGC-3 1 ; SEQ ID NO: 13). The 
5 samples were resuspended in 10 jllI annealing buffer (0.1 M NaCl, 0.01 M, pH 8.0, 
0.001 M EDTA), heated to 85°C for five minutes, the incubated at 37°C four two 
hours. Twenty microliters of reverse transcription buffer (10 mM DTT, 16 mM 
MgC12, 1 mM dNTP, 1 U/jxl RNasin, 0.1 M Tris, pH 8.0) was added to each 
sample as well as 20 U Superscript™ reverse transcriptase (Life Technologies). 
10 Reverse transcription was allowed to proceed at 40°C for one hour, and 7.5 (ill of 
each sample was mixed with an equal volume of formamide loading buffer. After 
denaturation at 95°C for five minutes, 7 \x\ of each sample was loaded onto an 8% 
denaturing polyacrylamide gel. After electropheresis, the gel was dried and 
exposed to film. 

15 Detection of exon la by RT-PCR 

RT-PCR was performed using rTth DNA polymerase (rTth RT-RNA 
PCR kit; Perkin Elmer, Branchburg, NJ) following the manufacturer's protocol. 
Reverse transcription was performed using an exon Ic-specific primer (5 f -ACA 
GGA GCG ACG GGC AGT TCT GCG T-3*; SEQ ID NO: 14), and the subsequent 

20 PCR reaction was performed by adding an exon 1 a-specific primer (5 ! -GAG TGC 
TCT GCC TGA TGA TC-3'; SEQ ID NO: 15) to the RT reaction according to the 
manufacturer's instruction. The PCR reaction consisted of 35 cycles of 
denaturation at 95°C for ten seconds, annealing/extension at 55°C for 15 seconds, 
and, at the end of the 35 cycles, an additional final extension at 55°C for seven 
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minutes. The PCR products were visualized on a 3% agarose gel with ethidium 
bromide, and a Southern analysis was performed using a 32 P end-labeled exon lb- 
specific primer (5'-CCA GTC TAG AAG CGG TGA TCG CCA-3'; SEQ ID NO: 
16). 

5 Construction of reporter gene constructs, generation and analysis of 

transgenic mice 

An Xbal-PstI lacZ cassette from pnlacF (Bonnerot et al., Proc. Natl. 
Acad. Sci. USA 84: 6795-6799, 1987) was subcloned into the and Pstl sites 
of pBluescript SK- (Stratagene) and regions of the genomic mCsx/Nkx2.5 DNA 

10 were cloned 5 f or 3 r to the lacZ cassette. The CsxlacZ-1 construct included 3.3 kb 
of genomic sequence, including most of intron 2 and the beginning of exon lc, 
between the Notl site and the ATG codon (Fig. 1). An Xhol-Pstl fragment 
containing part of exon lc, intron 3, and part of exon 2 was subcloned 3' to the 
lacZ gene in CsxlacZ-1 to make the construct CsxlacZ-2 (Fig. 1). The CsxlacZ-3 

1 5 construct contains a 4 kb fragment between the Spel site and the ATG codon of 
Csx/Nkx2.5, thus including a part of intron 1, exon lb, intron 2, and part of exon 
lc (Fig. 1). CsxlacZ-4 and CsxlacZ-5 contain 6 kb and 14 kb, respectively, of 
upstream 5' Csx/Nkx2.5 genomic sequence cloned upstream of the lacZ gene. A 6 
kb long 3' downstream genomic fragment was fused 3 ! to the lacZ gene in 

20 CsxlacZ-1 to create the construct CsxlacZ-6 (Fig. 1). 

For microinjection of CsxlacZ-1, -2, -3, and -6, mCsx/Nkx2J genomic 
sequence together with the lacZ cassette portion of each construct was excised 
from the vector using Xhol and Notl and purified by gel electrophoresis and 
Geneclean III (Biol 01, Vista, CA). In preparation for microinjection, CsxlacZ-4 
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and CsxlacZ-5 were linearized with^ol, then purified as indicated above. The 
creation of transgenic mice was done using standard methods (Hogan et al. (1994) 
Mani pulating the Mouse Embrvo . Cold Spring Harbor Laboratory Press, New 
York). 

For transient transgenic analysis, F 0 embryos were dissected at E10.5. 
Subsequent genotyping was performed using PCR on chromosomal DNA isolated 
from the: yolk sacs. PCR primer pairs used for detection of the transgenes were as 
follows: 5'-CCG TCC GAT GAA AAA CAG GAG-3' (SEQ ID NO: 17) and 5'- 
TCT GCT CTT CGT TGG CTG ATG-3' (SEQ ID NO: 18) for CsxlacZ-1, -2, and 
-3; 5'-CCG TCC GAT GAA AAA CAG GAG-3' (SEQ ID NO: 17) and 5'-TTA 
AGT TGG GTA ACG CCA GGG-3* (SEQ ID NO: 19) for CsxlacZ-4 and -5; and 
5'-AAC TTG CTA GGT AGA CTA GGC TGG C-3' (SEQ ID NO: 20) and 5'-TCT 
GCT CTT CGT TGG CTG ATG-3' (SEQ ID NO: 18) for CsxlacZ-6. 

Whole-mount P-gal staining was performed according to the method of 
Schlaeger et al. (Development 121: 1089-1098, 1995). After photographs were 
taken of the whole mounts, embryos were dehydrated through graded ethanol and 
xylene, embedded in paraffin, sectioned, and counterstained with Nuclear Fast Red 
(Vector Laboratories, Burlingame, CA). 

Generation of mouse lines carrying CsxlacZ-1 and CsxlacZ-5 
For CsxlacZ-1 and CsxlacZ-5, we established transgenic lines of mice. 
The F 0 mice carrying each transgene were backcrossed with FVB mice and Fj 
embryos were examined for lacZ expression. We identified one line of transgenic 
mice for each construct and analyzed them at different time points, including E7.5, 
E8.25, E9.5, E15.5, and adult. 
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Generation of Csx/NJoc2.5 knock-out and lacZ knock-in mice 

The entire coding region of mCsx/Nkx2 .5 was replaced with a lacZ- 

neomycin resistance gene (Neo) cassette by homologous recombination in a 

standard manner (Hogan et al., supra). 

In situ hybridization 

In situ hybridization was performed as described previously (Tanaka et 
al., Dev. Genet. 22: 239-249, 1998). Briefly, embryos were fixed in 4% 
formaldehyde at 4°C overnight and embedded in paraffin. Tissue sections were 
hybridized with a 35 S-labeled Csx/Nkx2.5 cRNA probe at 55°C, washed and treated 
with RNase A. After emulsion autoradiography, sections were counter-stained 
with hematoxylin and eosin. 

Example 2: Isolation of hCsx/Nkx2.5 promoter and cardiac-specific elements 

We have cloned an approximately 20 kb-long genomic clone containing 
about 17 kb upstream flanking sequence, as well as the first exon and a part of 
intron of the hCsx/Nkx2.5, from a Lambda FIXII genomic library (Stratagene) 
using radiolabeled hCsx/Nkx2.5 cDNA probes. To narrow down the cardiac 
enhancer, various regions of the upstream flanking sequence from the 20 kb clone 
were fused to the hsp68 promoter-linked lacZ reporter, hsp68-/acZ (Kothary et al., 
Development 105: 707-714 1989) (Figs. 3A-3C, 6). The expression patterns of 
these constructs were tested using transgenic mice to define the enhancer region of 
the cardiac specificity. 

Each construct was linearized at immediate 5 r end of the subcloned 
flanking; sequence, phenol: chloroform (1:1) and chloroform extracted one time 
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each, alcohol precipitated, and filtered using a 0.22 um pore size microfilter unit 
(Eppendorf 5 prime, Boulder, CO) after resuspended with deionized water. For 
transient transgene analysis, F 0 embryos were collected at either E9.5 or E10.5, 
formaldehyde fixed, PBS washed, and examined for the expression of the p- 
5 galactosidase. Genotyping was performed using PCR on the chromosomal DNA 
from the; yolk sacs. After the whole body embryos were incubated at 30°C for 12- 
14 hours in X-gal containing buffer, photographs of the lacZ positive embryos 
were taken. Consequently, the lacZ staining positive embryos were paraffin 
embedded, sectioned, and counter stained using Nuclear Fast Red. Some of the 
10 transgenic mice were established in the whole embryos (at E7.5 and El 2.5) or in 

PI 

?|J the hearts (three days after birth) (Figs. 7-9). 

m The results of the transgenic study using the constructs described above 

li allowed us to identify an approximately 7.5 kb-long enhancer sequence which 

m showed expression patterns very similar to the rnCsx/Nkx2.5 throughout the 

L 15 embryonic development; at E7.5 in the cardiac crescent, at E9.5 in the first 
^ branchial arch and the heart, and at E12.5 in the four chambers of the heart 

ffi including the outflow tract. There was a weak expression of the lacZ also in the 

f stomach region at E12.5. In the three day-old mouse, however, the enhancer 

activity of the 7.5 kb upstream flanking sequence of the hCsx/Nkx2.5 results in a 
20 expression pattern than that observed for the endogneous mouse gene. The 7.5 kb 
hCsx/Nkx2.5 enhancer-hsp68 promoter-/acZ is active in the right ventricle, but not 
in the atria or the left ventricle. 

We sequenced the 7.5 kb enhancer region of hCsx/Nkx2.5 (Figs. 4A and 
4B), and found regions of high sequence homology between the mouse and the 
25 human Csx/Nhc2.5 enhancers (Figs. 3A-3C, 5 A, 5B). When the homologous 
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regions of the hCsx/Nkx2.5 cardiac enhancer (homology domains Al and A2) were 
separated from each other, neither part showed enhancer activity. When they were 
religated by removing the intervening non-homologous sequence, however, the 
cardiac enhancer activity was regained, implying that both of the homologous 
5 regions are required for the cardiac enhancer activity. In these homologous 

sequences, we found several putative transcription factor binding sites, which are 
important in cardiac development, such as GATA4, MEF2C, dHAND, SRE, E- 
box, Sp -1, TGF-b responsive element, and Csx/Nkx2.5 binding sites. 

Example 3: Use of cardiac-specific enhancer elements in the differentiation and 
and purification of cardiac cells from stem cells 

Stem cells (e.g., embryonic stem (ES) cells and bone marrow stem cells) 
are capable of being induced into differentiating into cardiac cells. When mouse 
ES cells differentiate as cardiac cells, endogenous mCsx/Nkx2.5 is expressed. 
Thus it is highly likely that the enhancer elements of the present invention will also 
be active in stems cells of all types that differentiate as cardiac cells (Fig. 10). 

Optimizing induction of cardiac cells 

As is described above, current methods for inducing the differentiation 
of cardiac cells from stem cells have a low efficacy. Thus, in one aspect, the 
invention features a method for optimizing the efficacy of cardiac cell induction. 
20 In one example, a population of stem cells are transformed with a DNA vector that 
includes a reporter gene (e.g., a gene encoding GFP, p-galactosidase, or alkaline 
phosphatase) operably linked to a promoter and a cardiac-specific enhancer. 
Cardiac' cell differentiation is then induced by one or more induction methods. 
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Reporter gene expression is determined at the appropriate time: methods that result 
in a large number or percentage of reporter-positive cells are identified as suitable 
methods for the induction of cardiac cells. Because reporter gene expression will 
precede the beating phenotype currently used for scoring cardiac cell induction, the 
present method can be performed in a shorter time period and at a lower cost than 
previous methods. 

Selecting for or sorting out cardiac cells 

As the number of cardiac cells that differentiate from stem cells can very 
low, the; invention also features a method of selecting for cardiac cells (and thereby 
eliminating non-cardiac cells). In one example of this method, a gene encoding a 
selectable marker (e.g., neomycin) is operably linked to a cardiac cell enhancer- 
promoter construct. This construct can be introduced into stem cells or, 
alternatively, used to produce a transgenic animal (e.g., a transgenic mouse or a 
transgenic pig), from which stem cells are then isolated. Following induction of 
cardiac cell differentiation, selection is performed for the appropriate time. During 
selection, non-cardiac cells die, resulting in an enrichment for cardiac cells. 

It is likely that some stem cells express Csx/Nkx2.5 even prior to 
induction. These cells may have a greater capacity for producing cardiac cells. 
Accordingly, in another example, the selection step described above is performed 
prior to the induction step. 

In a related aspect, the invention features a method for separating 
cardiac cells or stem cells having a greater capacity for producing cardiac cells. In 
this example, the construct is as described above, except that the gene encoding a 
selectable marker is replaced with a reporter gene that allows for identification of 
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living cells (e.g., GFP). Reporter gene-positive cells can be identified and 
separated from reporter gene-negative cells using any appropriate method (e.g., 
fluorescence-activated cell sorting). Again, as in the previous method, the sorting 
step can be performed prior or subsequent to the induction step. 

Augmenting cardiac cell differentiation 

Several genes (e.g., BMP2/4, the Wnt family of genes, dHAND, eHAND, 
MEF-2Q GATA4, SRF,p300, IRX4 and Csx/Nkx2.5) have been implicated in 
cardiogenesis. Some of these cardiogenic genes are expressed earlier than 
Csx/Nb:2.5, while others are concomitant or later that Csx/Nkx2.5. Regardless of 
their temporal expression, it is likely that expression of one of the foregoing genes 
under the control of a cardiac specific enhancer, such as those described herein, 
will result in an augmentation of cardiac cell differentiation (i.e., accelerated 
differentiation, an increase in the percentage of cardiac cells, or both). Therefore, 
the invention features methods for augmenting cardiac cell differentiation from a 
population of stem cells by expressing a cardiogenic gene under the control of a 
cardiac specific enhancer element. Preferably, the method also includes a step in 
which cardiac cell differentiation is induced (e.g., by introduction of 5- 
azacytidine). 

Example 4: Use of cardiac-specific enhancer elements for gene therapy 

The use of gene therapy to introduce genes encoding therapeutic 
proteins into cardiac cells may be carried out in vivo or ex vivo. Thus, in one 
aspect, the invention provides a method of modifying cardiac cells of a patient in 
vivo by inserting into the cardiac cells a DNA vector that includes the gene 
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encoding a therapeutic protein operably linked to a cardiac-specific enhancer of 
the invention. Nucleic acid molecules are provided in solution or in any other 
pharmacologically suitable form for administration. Preferably, the DNA vectors 
are administered directly to the heart, but can also be administered systemically. 
5 There are many delivery methods known in the art for enhancing the 

uptake of nucleic acids by cells. Useful delivery systems include the Sendai virus- 
liposome delivery systems (see Rapaport and Shai, J. Biol. Chem. 269: 15124- 
15131, 1994), cationic liposomes, polymeric delivery gels or matrices, porous 
balloon catheters (Shi et al, Circulation 90: 955-951, 1994; and Shi et al., Gene 

10 Therapy 1:408-414, 1994), intraluminal pressure (PCT/US96/06271, herein 
incorporated by reference), retrovirus expression vectors, and the like. 

The use of liposomes as delivery vehicles is one particular method of 
interest. The liposomes fuse with the cells of the target site and deliver their 
contents intracellularly. As noted above, liposomes are maintained in contact with 

15 the cells for a time sufficient for fusion, using various means to maintain contact, 
such as isolation, binding agents, and the like. Liposomes may be prepared with 
purified proteins or peptides that mediate fusion of membranes, such as those of 
the Sendai virus or influenza virus, etc. The lipids may be any useful combination 
of known liposome forming lipids, including, for example, cationic lipids, such as 

20 phosphatidylcholine. The remaining lipid will normally be neutral lipids, such as 
cholesterol, phosphatidyl serine, phosphatidyl glycerol, and the like. 

Liposomes may be prepared by the procedure described by Kato et al. 
(J. Biol. Chem. 266: 3361, 1991). This method allows for the incorporation into 
the lumen high molecular weight molecules, particularly nucleic acids of one 

25 kilobase pair or more. In this way DNA vectors may be introduced into cells 
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efficiently. 

In another aspect, cell populations (or organs) can be removed from the 
patient or donor animal, modified ex vivo by insertion of the DNA vector, and re- 
implanted into the patient or transplanted into another recipient. Methods for 
5 grafting cells into cardiac tissue are described, for example, in PCT publication 
WO 98/54301. In one example, stem cells are removed from a patient, their 
number expanded in culture, and then induced to differentiate as cardiac cells as 
described herein. At any time during the method, the DNA vector including the 
gene encoding the therapeutic gene may be introduced into the cells. The cells can 
10 transplanted into the patient or implanted into a different recipient of the same or 
different species. 

rf \ The donor species may be any species which is the same or different 

T Z from the recipient species, and which is able to provide the appropriate cells (e.g., 

stem cells capable of differentiating into cardiac cells). The donor may be of a 
^ 15 species which is allogenic or xenogenic to that of the recipient. Preferably, the 
M recipient is a mammal, e.g., a primate. Most preferably the recipient is human. 

Eft For human recipients, it is envisaged that human (i.e. allogenic) as well as pig (i.e. 

jT xenogenic) donors will be suitable, but any other mammalian species (e.g., bovine 

or non-human primate) may also be suitable as donors. For example, porcine 
20 aortic endothelial cells (PAEC), or the progenitor cells thereof, can be modified to 
express the gene encoding a therapeutic protein at effective levels, for grafting into 
a human recipient. 

Heterologous DNA can be inserted into the animal or an ancestor of the 
animal at the single-cell stage or early morula stage. The preferred stage is the 
25 single-cell stage although the process may be carried out between the two and 
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eight cell stages. A transgenic animal can thereby be obtained, which will pass the 
heterologous DNA onto its offspring. 

Gene transfer can also be performed in allografts using ex vivo 
transduction of cells prior to transplantation, or for xenotransplantation, where 
5 porcine transgenesis is established. Methods of preparing transgenic pigs are well 
known in the art, as discussed by Logan (Curr. Opin. Immunol. 12: 563-568, 2000) 
and the references cited therein. Any transgenic animal may be used in the present 
invention; pigs are particularly suitable because they are particularly amenable for 
xenotransplantation into a human recipient. Transgenic pigs may be produced by 

10 homologous recombination and other such techniques that destroy wild type gene 
function. By way of example, transgenic pigs may be produced utilizing 
homologous recombination techniques to produce a transgenic animal expressing 
the desired protein. 

For gene delivery, a variety of vectors or plasmids are available (see, 

15 Maniatis et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor, N.Y.; Ausubel et al., Current Protocols in Molecular Biology, Greene 
Publishing Associates, New York, V. 1&2 1996; Harlow and Lane Antibodies, A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1988, incorporated by 
reference herein). The common vectors described therein are able to transiently or 

20 stably be maintained in smooth muscle cells, usually for a period of at least about 
one day, more usually for a period of at least about several days. 

Retroviral vectors (e.g., lentivirus), and in particular, replication-defective 
retroviral vectors lacking one or more of the gag, pol, and env sequences required 
for retroviral replication, are well-known to the art and may be used to transform 

25 endothelial or other mammalian cells. PA 3 17 cells or other producer cell lines 
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producing helper-free viral vectors are well-described in the literature. A 
representative retroviral construct comprises at least one viral long terminal repeat 
and promoter sequences upstream of the nucleotide sequence of the therapeutic 
substance and at least one viral long terminal repeat and polyadenylation signal 
5 downstr eam of the sequence. 

V ectors derived from adenoviruses, i.e. viruses that cause upper respiratory 
disease in humans and are present in latent infections in primates, are also 
generally known in the art and useful in the present invention. The ability of 
adenoviruses to attach to cells at low ambient temperatures is an advantage in the 
10 transplant setting which can facilitate gene transfer during cold preservation of 
tissues or organs. Adenoviral mediated gene transfer into vessels or organs by 

si 

: vj means of transduction perfusion is also a means of modifying cells in vivo. 

r r ] Prior to implantation into a recipient species, the treated cells may be 

jj s ; screened for modified cells containing and expressing the construct. For this 

^ 15 purpose, the DNA vector can also be provided with a second nucleotide sequence 
M« encoding an expression product that confers resistance to a selectable marker 

§1 substance. Suitable selection markers for screening include the neo gene, 

y : conferring resistance to neomycin or the neomycin analog, G41 8. Although any 

mammalian cell can be targeted for insertion of the gene, the preferred cells for 
20 manipulation are stem cells. 

All publications cited herein are hereby incorporated by reference. 
What is claimed is: 
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